home *** CD-ROM | disk | FTP | other *** search
Text File | 1993-01-04 | 2.3 KB | 52 lines | [TEXT/QED1] |
- This describes "Charset Options..." (String Nucleotides menu item of
- Convert menu).
-
- Problem: It is difficult to create or keep track of PAUP Charset/Exset
- blocks.
-
- Creating: Use "Create" option and a "label" line included in your
- alignment (in string format). The following example is
- included in the file, "Label_Line.str":
-
- aa *********&**********************&&&&****************
- ab acttacttggacacttacttggacacttacttggacacttacttggacactt
- ac acttacttggacacttacttggacacttacttggacacttacttggacactt
- ad acttacttggacacttacttggacacttacttggacacttacttggacactt
-
- Here, a label line "aa" was used with the same format and length as the
- real strings. Any symbols could have been used except spaces (also avoid
- dashes or periods and put it last if you are going to "match"
- your sequences to the first sequence, e.g., in Aligner). In the
- present example, the symbol "&" might be used for some site of
- interest, perhaps a site one wishes to exclude or weight differently.
- The "Create" option will prompt for a line number to use as
- label line (i.e., use as many label lines as you want for different
- purposes), then prompt for a symbol to use (e.g. "&"), and finally
- prompt for the character number of the first site (use "1" unless
- you are combining data sets and want to automatically adjust the
- charset/exset to start at some higher character number), and then a
- new file will be created containing the charset/exset block.
- The sample file "Sample_Exset" is what would be "created" using the
- file "Label_Line.str" and the "Create" option:
-
- exset ExclSites = 10 33-36 ;
-
- If you want a "charset" rather than a "exset", you need to substitute
- "exset" for "charset". They are of similar utility in PAUP but the
- syntax for referring to them is slightly different.
-
- Extracting: If you now want to use this exset (or charset) to
- extract particular sites from your aligned strings,
- then use the "Extract" option. After you have already
- specified the input file with the string data and the
- output file, and have selected "Extract", there is
- a prompt for specification of the file containing the
- charset or exset file (sorry, I know this is awkward).
- In the above example, you would extract the following:
-
- aa &&&&&
- ab gggac
- ac gggac
- ad gggac
-
-